The Bioperl toolkit: Perl modules for the life sciences.

نویسندگان

  • Jason E Stajich
  • David Block
  • Kris Boulez
  • Steven E Brenner
  • Stephen A Chervitz
  • Chris Dagdigian
  • Georg Fuellen
  • James G R Gilbert
  • Ian Korf
  • Hilmar Lapp
  • Heikki Lehväslaiho
  • Chad Matsalla
  • Chris J Mungall
  • Brian I Osborne
  • Matthew R Pocock
  • Peter Schattner
  • Martin Senger
  • Lincoln D Stein
  • Elia Stupka
  • Mark D Wilkinson
  • Ewan Birney
چکیده

The Bioperl project is an international open-source collaboration of biologists, bioinformaticians, and computer scientists that has evolved over the past 7 yr into the most comprehensive library of Perl modules available for managing and manipulating life-science information. Bioperl provides an easy-to-use, stable, and consistent programming interface for bioinformatics application programmers. The Bioperl modules have been successfully and repeatedly used to reduce otherwise complex tasks to only a few lines of code. The Bioperl object model has been proven to be flexible enough to support enterprise-level applications such as EnsEMBL, while maintaining an easy learning curve for novice Perl programmers. Bioperl is capable of executing analyses and processing results from programs such as BLAST, ClustalW, or the EMBOSS suite. Interoperation with modules written in Python and Java is supported through the evolving BioCORBA bridge. Bioperl provides access to data stores such as GenBank and SwissProt via a flexible series of sequence input/output modules, and to the emerging common sequence data storage format of the Open Bioinformatics Database Access project. This study describes the overall architecture of the toolkit, the problem domains that it addresses, and gives specific examples of how the toolkit can be used to solve common life-sciences problems. We conclude with a discussion of how the open-source nature of the project has contributed to the development effort.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Bio-Community Perl toolkit for microbial ecology

The development of bioinformatic solutions for microbial ecology in Perl is limited by the lack of modules to represent and manipulate microbial community profiles from amplicon and meta-omics studies. Here we introduce Bio-Community, an open-source, collaborative toolkit that extends BioPerl. Bio-Community interfaces with commonly used programs using various file formats, including BIOM, and p...

متن کامل

TFBS: Computational framework for transcription factor binding site analysis

MOTIVATION TFBS is a set of integrated, object-oriented Perl modules for transcription factor binding site detection and analysis. It implements objects representing specificity profile matrices, binding sites and sets thereof, pattern generators, and pattern database interfaces. The modules are interoperable with the BioPerl open source system. AVAILABILITY AND SUPPLEMENTARY INFORMATION The ...

متن کامل

On the Ancestral Compatibility of Two Phylogenetic Trees with Nested Taxa

Compatibility of phylogenetic trees is the most important concept underlying widely-used methods for assessing the agreement of different phylogenetic trees with overlapping taxa and combining them into common supertrees to reveal the tree of life. The notion of ancestral compatibility of phylogenetic trees with nested taxa was recently introduced. In this paper we analyze in detail the meaning...

متن کامل

FeatureStack: Perl module for comparative visualization of gene features

SUMMARY FeatureStack is a Perl module for the automatic generation of multi-gene images. FeatureStack takes BioPerl-compliant gene or transcript features as input and renders them side by side using a user-defined BioPerl glyph. Output images can be generated in SVG or PNG format. FeatureStack comes with a new BioPerl glyph, decorated_gene, which can highlight protein features on top of gene mo...

متن کامل

Integrating Gene Ontology and Blast to predict gene functions

A GoBlast system was built to predict gene function by integrating Blast search and Gene Ontology (GO) annotations together. The operation system was based on Debian Linux 3.1, with Apache as the web server and Mysql database as the data storage system. FASTA files with GO annotations were taken as the sequence source for blast alignment, which were formatted by wu-formatdb program. The GoBlast...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genome research

دوره 12 10  شماره 

صفحات  -

تاریخ انتشار 2002